Part-Guided Attention Learning for Vehicle Instance Retrieval

نویسندگان

چکیده

Vehicle instance retrieval (IR) often requires one to recognize the fine-grained visual differences between vehicles. Besides holistic appearance of vehicles which is easily affected by viewpoint variation and distortion, vehicle parts also provide crucial cues differentiate near-identical Motivated these observations, we introduce a Part-Guided Attention Network (PGAN) pinpoint prominent part regions effectively combine global local information for discriminative feature learning. PGAN first detects locations different components salient regardless identity, serves as bottom-up attention narrow down possible searching regions. To estimate importance detected parts, propose Part Module (PAM) adaptively locate most with high-attention weights suppress distraction irrelevant relatively low weights. The PAM guided identification loss therefore provides top-down that enables attention be calculated at level car other Finally, aggregate features together improve performance further. combines part-guided bottom-up top-down attention, in an end-to-end framework. Extensive experiments demonstrate proposed method achieves new state-of-the-art IR on four large-scale benchmark datasets.1

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple-Instance Learning for Music Information Retrieval

•Multiple-instance learning (MIL) algorithms train classifiers from lightly supervised data – collections of instances, called bags, are labeled rather than the instances themselves – algorithms can classify bags or instances, we focus on instances •Motivation for applying MIL to MIR: – propagate metadata between granularities: artist, album, track, 10-second clip – e.g. train clip classifiers ...

متن کامل

Attention-based Deep Multiple Instance Learning

Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the bag label probability is fully parameterized by neural networks. Furthermore, we propose a neural network-based permutation-invariant aggregation operator tha...

متن کامل

Attention Guided Deep Imitation Learning

When a learning agent attempts to imitate human visuomotor behaviors, it may benefit from knowing the human demonstrator’s visual attention. Such information could clarify the goal of the demonstrator, i.e., the object being attended is the most likely target of the current action. Hence it could help the agent better infer and learn the demonstrator’s underlying state representation for decisi...

متن کامل

Learning Robust Hash Codes for Multiple Instance Image Retrieval

In this paper, for the first time, we introduce a multiple instance (MI) deep hashing technique for learning discriminative hash codes with weak bag-level supervision suited for large-scale retrieval. We learn such hash codes by aggregating deeply learnt hierarchical representations across bag members through a dedicated MI pool layer. For better trainability and retrieval quality, we propose a...

متن کامل

Optimization Strategies for Instance Retrieval

In this paper new techniques for optimizing instance retrieval in DL systems are described. The algorithms are evaluated with application examples from a natural language processing application.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Intelligent Transportation Systems

سال: 2022

ISSN: ['1558-0016', '1524-9050']

DOI: https://doi.org/10.1109/tits.2020.3030301